339 results found.
Written
Lexicon,
Language Type:
Monolingual
Languages:
Afrikaans Albanian Arabic Armenian Bangla Basque Bosnian Breton Bulgarian Catalan Croatian Czech Danish Dutch English Esperanto Estonian Filipino Finnish French Galician Georgian German Greek Hebrew Hindi Hungarian Icelandic Indonesian Italian Japanese Kazakh Korean Latvian Lithuanian Macedonian Malay Malayalam Norwegian Persian Polish Portuguese Romanian Russian Serbian Sinhala Slovak Slovenian Spanish Swedish Tamil Telugu Thai Turkish Ukrainian Urdu Vietnamese pt_br ze_en ze_zh zh_cn zh_tw
Availability:
Freely Available
License:
CreativeCommons Attribution 4.0 International
Size:
41 GByte Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yo Joong Choe | word2word | /N |
Documentation:
Yes, on the website.
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese English Japanese Others
Availability:
Freely Available
License:
Size:
353,055 entries Production Status:
Newly created-finished
Use:
Spelling Correction, Grammatical Error Correction
-
Paper title:GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Masato Hagiwara | GitHub Typo Corpus | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
MIT
Size:
None sentences Production Status:
Newly created-in progress
Use:
Multimedia Document Processing
-
Paper title:Visual Grounding Annotation of Recipe Flow Graph
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Taichi Nishimura | Recipe Flow Graph Bounding Box Dataset | /N |
Documentation:
Now document is not available.
Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
CreativeCommons
Size:
677 entries Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Annotation of Adverse Drug Reactions in Patients' Weblogs
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuki Arase | ADR annotation in patients' weblogs | /N |
Documentation:
This paper
Multimodal/Multimedia
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Owner
License:
TBA
Size:
10 hours Production Status:
Newly created-finished
Use:
Dialogue
-
Paper title:The AICO Multimodal Corpus – Data Collection and Preliminary Analyses
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kristiina Jokinen | AICO Corpus | /N |
Documentation:
yes
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Azerbaijani Belarusian Bulgarian Catalan Danish English Estonian Filipino Finnish Hindi Hungarian Indonesian Irish Italian Japanese Kazakh Korean Latvian Lithuanian Mongolian Norwegian Polish Portuguese Russian Serbian (Latin) Slovenian Spanish Swedish Tamil Turkish Ukrainian Urdu Uzbek Vietnamese ces deu ell fas fra isl kat mkd nld ron slk sqi zho
Availability:
Freely Available
License:
GNU-GPL v.3
Size:
45 billion words Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Geographically-Balanced Gigaword Corpora for 50 Language Varieties
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jonathan Dunn | GeoWAC | /N |
Documentation:
https://github.com/jonathandunn/earthlings
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Owner
License:
Size:
8748 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:A Contract Corpus for Recognizing Rights and Obligations
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ruka Funaki | Contract Corpus for Recognizing Rights and Obligations | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
Size:
45 MByte Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Extraction of the Argument Structure of Tokyo Metropolitan Assembly Minutes: Segmentation of Question-and-Answer Sets
-
Paper track:Infrastructural Issues/Large Projects/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Keiichi Takamaru | NTCIR14-QALab-PoliInfo-FormalRunDataset | /N |
Documentation:
https://github.com/kmr-y/NTCIR14-QALab-PoliInfo-FormalRunDataset/blob/master/README.md
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Owner
License:
MIT
Size:
38,062 tokens Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:English Recipe Flow Graph Corpus
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yoko Yamakata | English recipe flow graph corpus | /N |
Documentation:
Publicly available in English and Japanese
Written
Evaluation Data,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
Freely Available
License:
Research Purpose Only
Size:
1000 sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:A Test Set for Discourse Translation from Japanese to English
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Masaaki Nagata | Japanese to English Discourse Translation Test Set | /N |
Documentation:
None




